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T— I Many complex networks exhibit a percolation transition involving a macroscopic connected 

component, with universal features largely independent of the microscopic model and the 
macroscopic domain geometry. In contrast, we show that the transition to full connectivity is 
O strongly influenced by details of the boundary, but observe an alternative form of universality. 

2^ Our approach correctly distinguishes connectivity properties of networks in domains with 

equal bulk contributions. It also facilitates system design to promote or avoid full connectivity 



\^ for diverse geometries in arbitrary dimension. 



^ 1 Introduction 



I Random geometric network models comprise a collection of entities called nodes embedded in region of typically 



, ^ two or three dimensions, together with connecting links between pairs of nodes that exist with a probability related to 
the node locations. They appear in numerous complex systems including in nanoscience [S], epidemiology [4j[5], forest 
fires [g] , social networks [7][8] , and wireless communications [9||TT] . Such networks exhibit a general phenomenon called 
percolation [l2|[l3], where at a critical connection probability (controlled by the node density), the largest connected 
component (cluster) of the network jumps abruptly from being independent of system size (microscopic) to being 
proportional to system size (macroscopic). 
^ Percolation phenomena are closely related to thermodynamic phase transitions where the number of nodes N goes 
O to infinity and the critical percolation density is largely independent of the system size, shape, and of the microscopic 
I ^ I details of the model; the phenomenon of universality. At the critical point, conformal invariance in two dimensional 



networks leads to detailed expressions for the probability of a connection across general regions 14 and more general 



connections with conformal field theory 15 and Schramm-Loewner Evolution [16|. Here, we take a different approach 
^ and are concerned with finite networks and with questions related to percolation, but fundamentally different: What 
node density ensures a specified probability Pfc that the entire network is a single connected component (cluster), 
that is, fully connected? How is this probability affected by the shape of the network domain? 

These questions are crucial for many applications, including for example the design of reliable wireless mesh 
networks. These consist of communication devices (the nodes) that pass messages to each other via other nodes rather 
than a central router. This allows the network to operate seamlessly over a large area, even when nodes are moved or 
T-H deactivated. A fully connected network means that every node can communicate with every other node through direct 
or indirect connections. Mesh networks have been developed for many communication systems, including laptops, 
^ power distribution ("smart grid") technologies, vehicles for road safety or environmental monitoring, and robots in 

hazardous locations such as factories, mines and disaster areas [lO[ . 
rS For many applications of random geometric networks including those above, direct connection between two nodes 
^ i and j can be well described by a probability Hij = H{rij), a given function of the distance between the nodes 
I"!] = ~ Tjl- Often, the nodes are mobile or otherwise not located in advance, hence we assume N uniformly 
distributed nodes confined in a specified d-dimensional region V with area {d — 2) or volume {d = 3) denoted by 
V. The node density is then defined as p = N/V. For reference, we will later take H{rij) — exp[— (r^j/ro)''], where 
ro is a relevant length scale, and rj determines the sharpness of the cut-off. Note that when 77 — >■ cxd a step function 



corresponding to the popular unit disk deterministic model 17 is obtained, where connections have a fixed range 
tq. Our derivation however is completely general and only requires that Hij is sufficiently short-ranged compared to 
system size. Using this as a basis, we find that contrary to common belief and practice, the geometrical details of the 
confined space boundaries [corners, edges and faces) dominate the properties of the percolation transition. Moreover, 
the short-range nature of Hij allows us to separate individual boundary components and obtain analytic expressions 
for Pfc at high densities as a sum over their contributions. We confirm this through computer simulations and argue 
that the substantial improvement offered by our main result Eq. [7] can be used to predict, control, optimize or even 
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Figure 1: Isolated nodes shown as black balls concentrate at the boundaries of the domain and particularly near 
corners at higher densities. Nodes are placed randomly in a cube, with lighter colors indicating a higher probability 
of being in the largest connected component. We use t] = 2, while the side length of the cube is L = lOro. There are 
500 nodes in (a) and 700 nodes in (b). 



set benchmarks for achieving full network connectivity in a wide variety of suitable models and applications involving 
finite size geometries. 



2 Full connection probability 

As in conventional continuum percolation theory [18| , we start by utilizing a cluster expansion approach |19[ to derive 
a systematic perturbative method for determining the full connection probability Pfc as a function of density p. 
Formulation of the expansion can be summarized as follows. The probability of two nodes being connected (or not) 
leads to the trivial identity 1 = Hij + (1 — Hij). Multiplying over all links expresses the probabilities Hg of all 
2N(N-i)/2 possible graphs g, 

i<j 9 

Collecting terms according to largest cluster size we get 

1 = E ^5 + E ^9 + ■ • • + E (2) 

gGGN seCjv-i gGGi 

where Gn is the set of all possible graphs with largest cluster of size n G {1 . . . N}. The first term on the right hand 
side is the probability of being fully connected given a specific configuration of nodes. The average over all random 
configurations () = V~'^ /y d^r of this quantity is thus the overall probability of obtaining a fully connected network 
Pfc- Moreover, the main idea conveyed by Eq. ^ is that at high densities, full connectivity is most likely to be broken 
by a single isolated node (the Gn-i term); this is sufficient detail for most applications. Further corrections incorporate 
the probability of several isolated single nodes and smaller clusters of nodes, for which a systematic expansion can be 



developed 20 . 

Averaging Eq. ^ over all configurations and noting that to leading order the — 1 cluster is fully connected, and 
that all nodes are identical, the first order approximation becomes 

Pfc « i-( E 

N 

= 1-N{l[{l-H,,)} (3) 
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where the "connectivity mass" accessible from a node placed at ri is given by 

M(ri) - / H{r,2)dr2, (4) 



V 



Assuming that the volume V ^ pM(ri)^ for any ri, which is reasonable if the system is significantly larger than rg 
at moderate densities and that the number of nodes N is large, Eq[3] simplifies to 

Pf,^l~p [ e-P^'^'^'^dr^ . (5) 
Jv 



This equation is equivalent to Eq. (8) in Mao and Anderson 21 which was derived for the specific case of a square 
domain. Following numerous studies by probabilists and engineers T|[2 , these authors however assumed an exponential 
scaling of system size V with p which essentially renders boundary effects negligible. Scaling the system in such a 
way is a common approach as it corresponds to the limit of infinite density at fixed connection probability, however 
in practice this limit is approached only for unphysically large volumes. In contrast, we do not assume exponential 
growth of V, and also consider far more general geometries in any dimension d> 1. 

Without an exponentially growing volume V, the behavior of the full connection probability at high densities is 
qualitatively different: It is controlled by the exponential in Eq. ([s]), and hence node positions ri where the connectivity 
mass is small, that is, near the boundary of V. Thus in contrast to the usual situation in statistical mechanics, the 
boundaries (and in particular corners) are important, and we will see they in fact dominate. We illustrate this in 
Fig. [l] where nodes are placed randomly inside a cube and an average over a large number of possible graphs gives the 
connectivity of each node. Notice that isolated and hard-to-connect nodes shown as dark balls concentrate near the 
boundaries of the domain and particularly near corners at higher densities. This observation forms the basis of our 
work, and has led to a radically different understanding of connectivity in confined geometries which we now detail 
further. 



3 Boundary effects 

The contributions to the integrals in Eq. ^ come from ri at boundary components B d V of dimension ds, for 
example the bulk, the faces, and right angled edges and corners of a cube, with ds = 3, 2, 1 and respectively. The 
short-range nature of Hij allows us to isolate each boundary component, whilst to leading order the connectivity mass 
splits into independent radial and angular integrals, depending only on the local geometry of B and hence 

poo 

Mb = M{vb) = OJB / H{ry'^dr , (6) 



where ujb is the angle {d — 2) or solid angle {d = 3) subtended by B. For example, if is near a corner of the cube 
then ljb = (47r)/8, while near an edge ujb = (47r)/4, near faces ub = (47r)/2 and ujb — (47r) for the bulk interior. 
Hence, from Eq. ([s]) we see that corner contributions to Pfc as a function of p are exponentially larger than edge 
contributions which are themselves exponentially larger than face contributions etc. This simple argument shows that 
the dominant contribution to Pfc at high densities comes from the "pointiest" corners. 

Expanding H{ri2) about r2 near the corresponding boundary component we obtain a next to leading order ex- 
pansion for M{rB) which we can then use to approximately evaluate the integral in Eq. Ignoring exponentially 
smaller correction terms and combining all boundary contributions we arrive at our main result 

Pfc^l-pY.GBVBe-'"'^ , (7) 

B 

where Vb is the ds-dimensional "volume" of each component (equal to one in the case of a 0-dimensional corner and 
V when dB = d), Gb is a geometrical factor depending on B and implicitly on H and Mb is as in ([6]); see examples 
below. Notice that Eq. ([7| is completely general as we have only assumed a short-ranged Hij and not used its specific 
form. Moreover, it also does not depend on using Euclidean distance and holds in any dimension d > 1 and geometry 
where the lack of connectivity is dominated by a situation involving an N — 1 cluster and a single disconnected node. 
Hence Eq. ([7|) is a powerful and useful multi-purpose tool for analyzing full network connectivity at high densities in 
a wide variety of suitable models and applications involving finite size geometries. 

For example, in the context of single input single output (SISO) wireless communication channels and a Rayleigh 



fading model 22 , information theory predicts H(rij) — exp[— (rij/rg)''] with i] an environment and wavelength 
dependent decay parameter equal to 2 for free propagation, increasing to ry « 4 for a cluttered environment, while rp 
depends on the minimum outage rate threshold. For nodes confined to a cube of side length L and rj = 2 we find 
Vb = i"*^, Gb = {^^^"^^^^ /■Kprl)^^'^^ , and Mb = (ro\/7r)^2'*^"^ with contributions from each of the eight corners. 
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(a) (b) 




Figure 2: (a) Comparison of the full analytic prediction of Eq. Q (solid curve) with direct numerical simulation of 
the random network in a cube of side length 7ro (jagged curve). The dashed line corresponds to the bulk contribution 
(previous theory), (b) Contributions from the bulk (dotted blue, left), faces (red), edges (yellow) and corners (green, 
right), together with the total (solid blue) and numerical simulation (black jagged curve), showing the dominance of 
the corners at the highest densities and good agreement between theory and simulation at moderate to high densities. 
Here it is convenient to plot the outage probability Pout = 1 — Pfc- 



twelve edges, six faces and bulk. However the derivation is general: Once Gb and Mg have been evaluated for these 
boundary components (right angled edges etc.) by standard asymptotic analysis of the relevant integrals, they apply 
to any geometry with these features and length scales significantly larger than tq. This independence on the large 
scale geometry follows from the short-range nature of Hij and is a type of universality allowing for the calculation of 
Pfc in complex high dimensional geometries without increased difficulty. 

The substantial improvement offered by Eq. ([7]) becomes clear when compared with the "bulk" contribution 
corresponding to current conventional wisdom shown in Fig. |2^) for a network confined to a cube. Fig. [2Jd) further 
demonstrates the inaccuracy of the bulk model as well as the benefits of including boundary effects when analyzing 
network connectivity in confined geometries. 

We can go beyond simple geometries restricted to right-angled corners. Consider the case of a two dimensional 
triangle with general angles < a;^ < tt. The relevant integrals for this case come to Mb = TqUJbI'^', with Gb = 
A/wp^rQ sinws for the corners and Gb = (2^~''^~-'^/7rprg)^~'^^ for the edges and bulk and can be generalized easily to 
higher dimensions. Fig. [3] shows two triangles chosen to have identical perimeter and area; the connectivity at a given 
density differs only due to the corner angles and agrees perfectly with the full theory of Eq. ([t]). A bulk theory, even 
supplemented with edge contributions, is clearly incapable of explaining the difference between the connectivities of 
networks in these two triangles. Moreover, such a situation motivates inverse problems, similar to "hearing the shape 



of a drum" 23 by attempting to determine the size and shape details of an unknown domain containing a random 
network. 



4 Discussion 

An important aspect of the theory presented here is how it affects the design of real life random geometric networks. 
For wireless mesh networks, the lack of connectivity near the boundaries can be mitigated by increasing the signal 
power, the number of spatial channels, or by constructing a hybrid network with a regular array of fixed nodes 
along the boundaries as well as randomly placed nodes in the interior. In each of these cases, the design can now 
be analyzed given information about the cost and connectivity function H{r) and of course the desired connectivity 
region. Conversely, boundary effects can be harnessed to avoid full connectivity if desired. For example in the case 
of forest fires [6] we have a prediction for the number of unburnt regions as a function of the geometric landscape 
and environment parameters (for example angles between fire-lanes and/or natural boundaries), again given a specific 
model for connectivity that depends on the type of vegetation, temperature, moisture content etc. Similar models 
could be devised for the spread of epidemics [4] or mobile phone viruses where boundaries are embedded in a more 
complex (possibly non-Euclidean) space yet Hij is still short-ranged. 

We examined connectivity in confined geometries and illustrated the importance of the often neglected boundary 
effects. We then derived a general high density expansion Eq. Q for the probability of full connectivity Pfc assuming 
only a short-ranged connectivity function relative to system size and showed that it displays universal features allowing 
for its easy calculation in complex geometries. This we have confirmed through computer simulations and argued that 
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(a) (b) 




Figure 3: Corner contributions in triangles with equal area and perimeter: Comparison of theory with direct 
simulation, as in Fig. |2] The red triangle has side lengths of 26.88, 15.44 and 15.44 in units of the connectivity length 
scale ro, while the blue triangle has side lengths of 8.40, 24.68 and 24.68. The black dashed lines correspond to the 
equal bulk (left curve) and bulk+edge (right curve) contributions while neglecting corner contributions. The colored 
curves give the total (including crucial corner) contributions for each triangle. Both theory and simulation are plotted, 
showing excellent agreement with the numerical simulations (jagged curves) which cover them completely for p > 4. 

our approach is well placed to facilitate efficiency in design in a variety of physical applications ranging from wireless 
networks to forest fire-lanes. Appropriate modifications of our theory can aid the understanding of small boundary- 
dominated systems such as for example the electrical conduction through carbon nanotubes in a polymer matrix [3] 
but possibly larger systems such as highly connected social and financial networks [7|[8]. 
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